Accounting for Boundary Effects in Nearest - Neighbor

نویسنده

  • O. Narayan
چکیده

Given n data points in d-dimensional space, nearest-neighbor searching involves determining the nearest of these data points to a given query point. Most averagecase analyses of nearest-neighbor searching algorithms are made under the simplifying assumption that d is fixed and that n is so large relative to d that boundary effects can be ignored. This means that for any query point the statistical distribution of the data points surrounding it is independent of the location of the query point. However, in many applications of nearest-neighbor searching (such as data compression by vector quantization) this assumption is not met, since the number of data points n grows roughly as 2d . Largely for this reason, the actual performances of many nearest-neighbor algorithms tend to be much better than their theoretical analyses would suggest. We present evidence of why this is the case. We provide an accurate analysis of the number of cells visited in nearest-neighbor searching by the bucketing and k-d tree algorithms. We assume md points uniformly distributed in dimension d, where m is a fixed integer ≥2. Further, we assume that distances are measured in the L∞ metric. Our analysis is tight in the limit as d approaches infinity. Empirical evidence is presented showing that the analysis applies even in low dimensions. ∗ A preliminary version of this paper appeared in the Proceedings of the 11th Annual ACM Symposium on Computational Geometry, 1995, pp. 336–344. Part of this research was conducted while the first author was visiting the Max-Planck-Institut für Informatik, Saarbrücken, Germany. The first author was supported by the ESPRIT Basic Research Actions Program, under Contract No. 7141 (project ALCOM II). The support of the National Science Foundation under Grant CCR–9310705 is gratefully acknowledged by the second author. The third author was supported in part by AT&T Bell Laboratories and the Society of Fellows at Harvard University. 156 S. Arya, D. M. Mount, and O. Narayan

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Edge Detection Based On Nearest Neighbor Linear Cellular Automata Rules and Fuzzy Rule Based System

 Edge Detection is an important task for sharpening the boundary of images to detect the region of interest. This paper applies a linear cellular automata rules and a Mamdani Fuzzy inference model for edge detection in both monochromatic and the RGB images. In the uniform cellular automata a transition matrix has been developed for edge detection. The Results have been compared to the ...

متن کامل

Edge Detection Based On Nearest Neighbor Linear Cellular Automata Rules and Fuzzy Rule Based System

 Edge Detection is an important task for sharpening the boundary of images to detect the region of interest. This paper applies a linear cellular automata rules and a Mamdani Fuzzy inference model for edge detection in both monochromatic and the RGB images. In the uniform cellular automata a transition matrix has been developed for edge detection. The Results have been compared to the ...

متن کامل

Accounting for Boundary E ects in Nearest NeighborSearching 1

Given n data points in d-dimensional space, nearest neighbor searching involves determining the nearest of these data points to a given query point. Most average-case analyses of nearest neighbor searching algorithms are made under the simplifying assumption that d is xed and that n is so large relative to d that boundary eeects can be ignored. This means that for any query point the statistica...

متن کامل

High order perturbation study of the frustrated quantum Ising chain

In this paper, using high order perturbative series expansion method, the critical exponents of the order parameter and susceptibility in transition from ferromagnetic to disordered phases for 1D quantum Ising model in transverse field, with ferromagnetic nearest neighbor and anti-ferromagnetic next to nearest neighbor interactions, are calculated. It is found that for small value of the frustr...

متن کامل

FUZZY K-NEAREST NEIGHBOR METHOD TO CLASSIFY DATA IN A CLOSED AREA

Clustering of objects is an important area of research and application in variety of fields. In this paper we present a good technique for data clustering and application of this Technique for data clustering in a closed area. We compare this method with K-nearest neighbor and K-means.  

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997